Monaural Speech Enhancement Using a Multi-Branch Temporal Convolutional Network

نویسندگان

چکیده

Deep learning has achieved substantial improvement on single-channel speech enhancement tasks. However, the performance of multi-layer perceptions (MLPs)-based methods is limited by ability to capture long-term effective history information. The recurrent neural networks (RNNs), e.g., long short-term memory (LSTM) model, are able temporal dependencies, but come with issues high latency and complexity training.To address these issues, convolutional network (TCN) was proposed replace RNNs in various sequence modeling In this paper we propose a novel TCN model that employs multi-branch structure, called (MB-TCN), for monaural enhancement.The MB-TCN exploits split-transform-aggregate design, which expected obtain strong representational power at low computational complexity.Inspired TCN, incorporates one dimensional causal dilated CNN residual expand receptive fields capturing contextual information.Our extensive experimental investigation suggests MB-TCNs outperform (ResLSTMs), (TCNs), employ dense aggregations terms intelligibility quality, while providing superior parameter efficiency. Furthermore, our results demonstrate multiple state-of-the-art deep learning-based five widely used objective metrics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models

A Pascal challenge entitled monaural multi-talker speech recognition was developed, targeting the problem of robust automatic speech recognition against speech like noises which significantly degrades the performance of automatic speech recognition systems. In this challenge, two competing speakers say a simple command simultaneously and the objective is to recognize speech of the target speake...

متن کامل

A Fully Convolutional Neural Network for Speech Enhancement

In hearing aids, the presence of babble noise degrades hearing intelligibility of human speech greatly. However, removing the babble without creating artifacts in human speech is a challenging task in a low SNR environment. Here, we sought to solve the problem by finding a ‘mapping’ between noisy speech spectra and clean speech spectra via supervised learning. Specifically, we propose using ful...

متن کامل

Multi-Branch Fully Convolutional Network for Face Detection

Face detection is a fundamental problem in computer vision. It is still a challenging task in unconstrained conditions due to significant variations in scale, pose, expressions, and occlusion. In this paper, we propose a multi-branch fully convolutional network (MB-FCN) for face detection, which considers both efficiency and effectiveness in the design process. Our MB-FCN detector can deal with...

متن کامل

Deep Recurrent Neural Network Based Monaural Speech Separation Using Recurrent Temporal Restricted Boltzmann Machines

This paper presents a single-channel speech separation method implemented with a deep recurrent neural network (DRNN) using recurrent temporal restricted Boltzmann machines (RTRBM). Although deep neural network (DNN) based speech separation (denoising task) methods perform quite well compared to the conventional statistical model based speech enhancement techniques, in DNN-based methods, the te...

متن کامل

SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement

This paper proposes a signal-to-noise-ratio (SNR) aware convolutional neural network (CNN) model for speech enhancement (SE). Because the CNN model can deal with local temporal-spectral structures of speech signals, it can effectively disentangle the speech and noise signals given the noisy speech signals. In order to enhance the generalization capability and accuracy, we propose two SNR-aware ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Social Science Research Network

سال: 2022

ISSN: ['1556-5068']

DOI: https://doi.org/10.2139/ssrn.4108416